Picture for Masashi Sugiyama

Masashi Sugiyama

Tokyo Institute of Technology

Bifrost: Steering Strategic Trajectories to Bridge Contextual Gaps for Self-Improving Agents

Add code
Feb 05, 2026
Viaarxiv icon

Causal Graph Learning via Distributional Invariance of Cause-Effect Relationship

Add code
Feb 03, 2026
Viaarxiv icon

Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models

Add code
Jan 28, 2026
Viaarxiv icon

Scalable Oversight via Partitioned Human Supervision

Add code
Oct 26, 2025
Viaarxiv icon

LLM Routing with Dueling Feedback

Add code
Oct 01, 2025
Viaarxiv icon

Generalized Linear Bandits: Almost Optimal Regret with One-Pass Update

Add code
Jul 16, 2025
Viaarxiv icon

Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability

Add code
Jun 12, 2025
Viaarxiv icon

On Symmetric Losses for Robust Policy Optimization with Noisy Preferences

Add code
May 30, 2025
Viaarxiv icon

Practical estimation of the optimal classification error with soft labels and calibration

Add code
May 27, 2025
Viaarxiv icon

Domain Adaptation and Entanglement: an Optimal Transport Perspective

Add code
Mar 11, 2025
Figure 1 for Domain Adaptation and Entanglement: an Optimal Transport Perspective
Figure 2 for Domain Adaptation and Entanglement: an Optimal Transport Perspective
Figure 3 for Domain Adaptation and Entanglement: an Optimal Transport Perspective
Figure 4 for Domain Adaptation and Entanglement: an Optimal Transport Perspective
Viaarxiv icon